Add MoE prepare input kernels #29

kareemshaik80 · 2025-10-30T05:18:03Z

restructure moe kernels folder
add prepare moe inputs kerels
- compute_problem_sizes
- compute_expert_offsets
- compute_expert_blockscale_offsets
- compute_arg_sorts
- ShuffleRows
- ApplyShuffleMulSum

- restructure moe kernels folder - add prepare moe inputs kerel Signed-off-by: kareem <[email protected]>

src/sycl/kernels/moe/activations.cpp

Signed-off-by: kareem <[email protected]>

Signed-off-by: Shaik, Kareem M <[email protected]>

Signed-off-by: kareem <[email protected]>

adityachatter

LGTM.

src/sycl/kernels/moe/activations.cpp

tests/test_moe_prepare_input.py

src/sycl/kernels/moe/prepare_inputs.cpp

msinnha1

One generic comment also to use error handling in the functions:

example, in void prepare_moe_input()
TORCH_CHECK(topk_ids.dtype() == torch::kInt32, "topk_ids must be int32");
...

src/sycl/kernels/moe/prepare_inputs.cpp

src/sycl/kernels/moe/topk_softmax.cpp

tests/test_moe_prepare_input.py

src/sycl/kernels/moe/prepare_inputs.cpp

Signed-off-by: Shaik, Kareem M <[email protected]>

airMeng · 2025-12-10T14:03:15Z

@kareemshaik80 please rebase with the latest main

airMeng

generally follow SGLang CUDA kernel, LGTM except some minor comments

tests/test_moe_prepare_input.py

src/sycl/MoEPrepareInputs.cpp

mingfeima

good job for this one!

just some minor places to change, and it shall be fine.

src/sycl/MoEPrepareInputs.cpp

tests/test_moe_prepare_input.py

mingfeima · 2025-12-11T03:06:47Z

@kareemshaik80 could you please also collect how much is the ratio of shuffling activations accounts in the whole kernel for our benchmarks? i want to understand the overhead this creates for MoE layers.

Signed-off-by: Shaik, Kareem M <[email protected]>

airMeng

Please add the new UT into https://github.com/sgl-project/sgl-kernel-xpu/blob/main/tests/run_suite.py

I forgot it too :)

Signed-off-by: Shaik, Kareem M <[email protected]>

mingfeima · 2025-12-12T02:08:21Z

@kareemshaik80 could you please also collect how much is the ratio of shuffling activations accounts in the whole kernel for our benchmarks? i want to understand the overhead this creates for MoE layers.

@kareemshaik80 good work here! please continue to analysis how much overhead the shuffle takes, you can share data internally.

This reverts commit ac9e2a7.

…project#57)" This reverts commit eb9cfca.

* Revert "Revert "Add MoE prepare input kernels (#29)" (#57)" This reverts commit eb9cfca. Signed-off-by: Shaik, Kareem M <[email protected]>

* Restructure MoE and add prepare inputs/meta kernel - restructure moe kernels folder - add prepare moe inputs kerel Signed-off-by: kareem <[email protected]> * fix minor issues Signed-off-by: kareem <[email protected]> * Add tests Signed-off-by: kareem <[email protected]> * Add shuffle_rows Kernel Signed-off-by: kareem <[email protected]> * register shuffle_rows Signed-off-by: kareem <[email protected]> * Enable Build and Add apply_shuffle_mul_sum kernel Signed-off-by: kareem <[email protected]> * functional Signed-off-by: Shaik, Kareem M <[email protected]> * cleanup Signed-off-by: kareem <[email protected]> * cleanup1 Signed-off-by: kareem <[email protected]> * Modify fused expert to invoke moe_kernels and increase test coverage Signed-off-by: Shaik, Kareem M <[email protected]> * Cleanup makefile Signed-off-by: Shaik, Kareem M <[email protected]> * remove debug code Signed-off-by: Shaik, Kareem M <[email protected]> * fix lint Signed-off-by: Shaik, Kareem M <[email protected]> * Fix review comments Signed-off-by: Shaik, Kareem M <[email protected]> * Add to CI Signed-off-by: Shaik, Kareem M <[email protected]> --------- Signed-off-by: kareem <[email protected]> Signed-off-by: Shaik, Kareem M <[email protected]>

This reverts commit ac9e2a7.

* Revert "Revert "Add MoE prepare input kernels (sgl-project#29)" (sgl-project#57)" This reverts commit eb9cfca. Signed-off-by: Shaik, Kareem M <[email protected]>

Restructure MoE and add prepare inputs/meta kernel

bab489e

- restructure moe kernels folder - add prepare moe inputs kerel Signed-off-by: kareem <[email protected]>

kareemshaik80 changed the title ~~Restructure MoE and add prepare inputs/meta kernel~~ Restructure MoE and add prepare inputs/meta kernel [wip] Oct 30, 2025

kareemshaik80 commented Oct 30, 2025

View reviewed changes

src/sycl/kernels/moe/activations.cpp Outdated Show resolved Hide resolved

fix minor issues

d5e78ac

Signed-off-by: kareem <[email protected]>

kareemshaik80 changed the title ~~Restructure MoE and add prepare inputs/meta kernel [wip]~~ Restructure MoE and add routing kernel [wip] Oct 30, 2025

kareemshaik80 and others added 5 commits November 3, 2025 08:11

Add tests

651c0f6

Signed-off-by: kareem <[email protected]>

Add shuffle_rows Kernel

328d63a

Signed-off-by: kareem <[email protected]>

register shuffle_rows

efb105f

Signed-off-by: kareem <[email protected]>

Enable Build and Add apply_shuffle_mul_sum kernel

d849b61

Signed-off-by: kareem <[email protected]>

functional

f2f1577

Signed-off-by: Shaik, Kareem M <[email protected]>

kareemshaik80 changed the title ~~Restructure MoE and add routing kernel [wip]~~ Restructure MoE and Add prepare input kernels Nov 10, 2025

kareemshaik80 changed the title ~~Restructure MoE and Add prepare input kernels~~ Restructure MoE and Add MoE prepare input kernels Nov 10, 2025

kareemshaik80 added 2 commits November 10, 2025 08:18

cleanup

33fe2ed

Signed-off-by: kareem <[email protected]>

cleanup1

8944fbc

Signed-off-by: kareem <[email protected]>

adityachatter approved these changes Nov 10, 2025

View reviewed changes

airMeng reviewed Nov 11, 2025

View reviewed changes

src/sycl/kernels/moe/activations.cpp Show resolved Hide resolved

tests/test_moe_prepare_input.py Outdated Show resolved Hide resolved

airMeng added the run-ci label Nov 11, 2025

airMeng requested changes Nov 11, 2025

View reviewed changes

src/sycl/kernels/moe/prepare_inputs.cpp Outdated Show resolved Hide resolved

kareemshaik80 added 2 commits November 13, 2025 12:02

Merge branch 'sgl-project:main' into main

1be44d6

Merge branch 'main' into main

887c49b

msinnha1 reviewed Dec 1, 2025

View reviewed changes

src/sycl/kernels/moe/prepare_inputs.cpp Outdated Show resolved Hide resolved

src/sycl/kernels/moe/topk_softmax.cpp Outdated Show resolved Hide resolved

tests/test_moe_prepare_input.py Outdated Show resolved Hide resolved

src/sycl/kernels/moe/prepare_inputs.cpp Outdated Show resolved Hide resolved

kareemshaik80 added 2 commits December 10, 2025 08:58

Modify fused expert to invoke moe_kernels and increase test coverage

516eecd

Signed-off-by: Shaik, Kareem M <[email protected]>

Cleanup makefile

896dfd8

Signed-off-by: Shaik, Kareem M <[email protected]>

kareemshaik80 changed the title ~~Restructure MoE and Add MoE prepare input kernels~~ Add MoE prepare input kernels Dec 10, 2025

remove debug code

46d500b

Signed-off-by: Shaik, Kareem M <[email protected]>

kareemshaik80 requested a review from airMeng December 10, 2025 11:21

fix lint

31630d2

Signed-off-by: Shaik, Kareem M <[email protected]>

Merge branch 'sgl-project:main' into main

29d784b

airMeng requested a review from mingfeima December 11, 2025 02:13

airMeng reviewed Dec 11, 2025

View reviewed changes

tests/test_moe_prepare_input.py Outdated Show resolved Hide resolved

src/sycl/MoEPrepareInputs.cpp Outdated Show resolved Hide resolved

mingfeima requested changes Dec 11, 2025

View reviewed changes

airMeng mentioned this pull request Dec 11, 2025

[MoE] refactor moe with new atom API #55

Merged

Fix review comments

e98c362

Signed-off-by: Shaik, Kareem M <[email protected]>

kareemshaik80 requested review from airMeng and mingfeima December 11, 2025 13:55

airMeng approved these changes Dec 11, 2025

View reviewed changes

airMeng reviewed Dec 11, 2025

View reviewed changes

Add to CI

82f0605

Signed-off-by: Shaik, Kareem M <[email protected]>

mingfeima approved these changes Dec 12, 2025

View reviewed changes

mingfeima merged commit ac9e2a7 into sgl-project:main Dec 12, 2025
2 of 3 checks passed

airMeng added a commit that referenced this pull request Dec 12, 2025

Revert "Add MoE prepare input kernels (#29)"

3b2954a

This reverts commit ac9e2a7.

airMeng mentioned this pull request Dec 12, 2025

Revert "Add MoE prepare input kernels" #57

Merged

airMeng added a commit that referenced this pull request Dec 12, 2025

Revert "Add MoE prepare input kernels (#29)" (#57)

eb9cfca

This reverts commit ac9e2a7.

kareemshaik80 added a commit to kareemshaik80/sgl-kernel-xpu that referenced this pull request Dec 12, 2025

Revert "Revert "Add MoE prepare input kernels (sgl-project#29)" (sgl-…

7a7a34b

…project#57)" This reverts commit eb9cfca.

airMeng pushed a commit that referenced this pull request Dec 12, 2025

Revert "Revert "Add MoE prepare input kernels"" (#58)

d4be3b8

* Revert "Revert "Add MoE prepare input kernels (#29)" (#57)" This reverts commit eb9cfca. Signed-off-by: Shaik, Kareem M <[email protected]>

sspintel pushed a commit to sspintel/sgl-kernel-xpu that referenced this pull request Jan 2, 2026

Revert "Add MoE prepare input kernels (sgl-project#29)" (sgl-project#57)

8e4c122

This reverts commit ac9e2a7.

Add MoE prepare input kernels #29

Add MoE prepare input kernels #29

Uh oh!

Conversation

kareemshaik80 commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

adityachatter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

msinnha1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

airMeng commented Dec 10, 2025

Uh oh!

airMeng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mingfeima left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mingfeima commented Dec 11, 2025

Uh oh!

airMeng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mingfeima commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

kareemshaik80 commented Oct 30, 2025 •

edited

Loading